fix(webapp,clickhouse): store task output as serialized String in ClickHouse#4092
Conversation
|
|
Note Reviews pausedIt looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the Use the following commands to manage reviews:
Use the checkboxes below for quick actions:
WalkthroughTask run output storage in ClickHouse is switched from the native JSON column to a plain 🚥 Pre-merge checks | ✅ 4 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (4 passed)
✨ Finishing Touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
…ckHouse Task run output is stored in a new output_raw String column instead of the native JSON output column. Deeply nested output could push the JSON column's accumulated type past ClickHouse 26.2's input_format_binary_max_type_complexity limit, failing the replication insert so the terminal row never landed and the run appeared stuck. A String has constant binary type complexity regardless of payload shape, so the failure mode is gone for both writes and reads. TRQL path access compiles to JSON_VALUE over output_raw, and bare reads plus full-text search use the String column directly (a new ngram index keeps search fast). The TRQL surface is unchanged. error and the payload column are untouched.
ba8f682 to
445e19f
Compare
There was a problem hiding this comment.
🧹 Nitpick comments (1)
internal-packages/clickhouse/schema/035_add_task_runs_v2_output_raw.sql (1)
9-10: 🚀 Performance & Scalability | 🔵 Trivial | 💤 Low valueNgram index won't cover pre-existing rows without materialization.
ADD INDEXonly applies to data written after the DDL. Existingtask_runs_v2rows (which still carry their output in the old columns and an emptyoutput_raw) won't be indexed, but since reads now targetoutput_raw, this only matters for historical rows that get backfilled. If you backfilloutput_rawfor old data, considerALTER TABLE ... MATERIALIZE INDEX idx_output_rawso search stays fast over historical rows.
ℹ️ Review info
⚙️ Run configuration
Configuration used: Repository UI
Review profile: CHILL
Plan: Pro
Run ID: 3f3f1c6f-0e2c-428a-b676-5bfac1aa87d1
📒 Files selected for processing (12)
.server-changes/clickhouse-output-string.mdapps/webapp/app/services/runsReplicationService.server.tsapps/webapp/app/v3/querySchemas.tsapps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/clickhouse/schema/035_add_task_runs_v2_output_raw.sqlinternal-packages/clickhouse/src/taskRuns.test.tsinternal-packages/clickhouse/src/taskRuns.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/tsql/src/query/printer.tsinternal-packages/tsql/src/query/schema.ts
📜 Review details
⏰ Context from checks skipped due to timeout. (24)
- GitHub Check: webapp / 🧪 Unit Tests: Webapp (9, 10)
- GitHub Check: webapp / 🧪 Unit Tests: Webapp (10, 10)
- GitHub Check: webapp / 🧪 Unit Tests: Webapp (4, 10)
- GitHub Check: webapp / 🧪 Unit Tests: Webapp (5, 10)
- GitHub Check: webapp / 🧪 Unit Tests: Webapp (8, 10)
- GitHub Check: webapp / 🧪 Unit Tests: Webapp (6, 10)
- GitHub Check: webapp / 🧪 Unit Tests: Webapp (7, 10)
- GitHub Check: webapp / 🧪 Unit Tests: Webapp (3, 10)
- GitHub Check: internal / 🧪 Unit Tests: Internal (7, 12)
- GitHub Check: webapp / 🧪 Unit Tests: Webapp (2, 10)
- GitHub Check: webapp / 🧪 Unit Tests: Webapp (1, 10)
- GitHub Check: internal / 🧪 Unit Tests: Internal (8, 12)
- GitHub Check: internal / 🧪 Unit Tests: Internal (12, 12)
- GitHub Check: internal / 🧪 Unit Tests: Internal (3, 12)
- GitHub Check: internal / 🧪 Unit Tests: Internal (1, 12)
- GitHub Check: internal / 🧪 Unit Tests: Internal (4, 12)
- GitHub Check: internal / 🧪 Unit Tests: Internal (10, 12)
- GitHub Check: internal / 🧪 Unit Tests: Internal (11, 12)
- GitHub Check: internal / 🧪 Unit Tests: Internal (6, 12)
- GitHub Check: internal / 🧪 Unit Tests: Internal (2, 12)
- GitHub Check: internal / 🧪 Unit Tests: Internal (9, 12)
- GitHub Check: internal / 🧪 Unit Tests: Internal (5, 12)
- GitHub Check: typecheck / typecheck
- GitHub Check: e2e-webapp / 🧪 E2E Tests: Webapp
🧰 Additional context used
📓 Path-based instructions (13)
internal-packages/clickhouse/schema/[0-9][0-9][0-9]_*.sql
📄 CodeRabbit inference engine (internal-packages/clickhouse/CLAUDE.md)
internal-packages/clickhouse/schema/[0-9][0-9][0-9]_*.sql: Migration file numbering: name files as0(N+1)_descriptive_name.sqlwhere N is the largest existing migration number inschema/; rebase and renumber if main adds migrations before opening a PR
DDL in migrations must be idempotent: useALTER TABLE ... ADD COLUMN IF NOT EXISTS,CREATE TABLE IF NOT EXISTS,DROP TABLE IF EXISTS,ADD INDEX IF NOT EXISTS,DROP INDEX IF EXISTS, andCREATE MATERIALIZED VIEW IF NOT EXISTSforms to allow out-of-order and retry-safe application
Files:
internal-packages/clickhouse/schema/035_add_task_runs_v2_output_raw.sql
**/*.{ts,tsx}
📄 CodeRabbit inference engine (.github/copilot-instructions.md)
**/*.{ts,tsx}: Use types over interfaces for TypeScript
Avoid using enums; prefer string unions or const objects instead
Files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
{packages/core,apps/webapp}/**/*.{ts,tsx}
📄 CodeRabbit inference engine (.github/copilot-instructions.md)
Use zod for validation in packages/core and apps/webapp
Files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsapps/webapp/test/runsReplicationService.part6.test.tsapps/webapp/app/services/runsReplicationService.server.ts
**/*.{ts,tsx,js,jsx}
📄 CodeRabbit inference engine (.github/copilot-instructions.md)
Use function declarations instead of default exports
Files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
**/*.{test,spec}.{ts,tsx}
📄 CodeRabbit inference engine (.github/copilot-instructions.md)
Use vitest for all tests in the Trigger.dev repository
Files:
apps/webapp/test/runsReplicationService.part3.test.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.ts
**/*.ts
📄 CodeRabbit inference engine (.cursor/rules/otel-metrics.mdc)
**/*.ts: When creating or editing OTEL metrics (counters, histograms, gauges), ensure metric attributes have low cardinality by using only enums, booleans, bounded error codes, or bounded shard IDs
Do not use high-cardinality attributes in OTEL metrics such as UUIDs/IDs (envId, userId, runId, projectId, organizationId), unbounded integers (itemCount, batchSize, retryCount), timestamps (createdAt, startTime), or free-form strings (errorMessage, taskName, queueName)
When exporting OTEL metrics via OTLP to Prometheus, be aware that the exporter automatically adds unit suffixes to metric names (e.g., 'my_duration_ms' becomes 'my_duration_ms_milliseconds', 'my_counter' becomes 'my_counter_total'). Account for these transformations when writing Grafana dashboards or Prometheus queries
Files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
apps/webapp/**/*.{ts,tsx}
📄 CodeRabbit inference engine (.cursor/rules/webapp.mdc)
apps/webapp/**/*.{ts,tsx}: Access environment variables through theenvexport ofenv.server.tsinstead of directly accessingprocess.env
Use subpath exports from@trigger.dev/corepackage instead of importing from the root@trigger.dev/corepathUse named constants for sentinel/placeholder values (e.g.
const UNSET_VALUE = '__unset__') instead of raw string literals scattered across comparisons
Files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsapps/webapp/test/runsReplicationService.part6.test.tsapps/webapp/app/services/runsReplicationService.server.ts
apps/webapp/**/*.test.{ts,tsx}
📄 CodeRabbit inference engine (.cursor/rules/webapp.mdc)
Do not import
env.server.tsdirectly or indirectly into test files; instead pass environment-dependent values through options/parameters to make code testableFor testable code, never import
env.server.tsin test files. Pass configuration as options instead (e.g.,realtimeClient.server.tstakes config as constructor arg,realtimeClientGlobal.server.tscreates singleton with env config)
Files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/test/runsReplicationService.part6.test.ts
**/*.{ts,tsx,js,jsx,mts,cts,mjs,cjs}
📄 CodeRabbit inference engine (CLAUDE.md)
**/*.{ts,tsx,js,jsx,mts,cts,mjs,cjs}: Usepnpm run typecheckfor changes in apps (apps/*) and internal packages (internal-packages/*), and never usebuildto verify those changes.
Use Vitest for tests, and never mock anything; use testcontainers instead.
Prefer static imports over dynamicimport(), and only use dynamic imports for unresolved circular dependencies, genuine code-splitting needs, or conditional runtime loading.
Files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
**/*.{test,spec}.{ts,tsx,js,jsx,mts,cts,mjs,cjs}
📄 CodeRabbit inference engine (CLAUDE.md)
Place test files next to the source files they cover (for example,
MyService.ts->MyService.test.ts).
Files:
apps/webapp/test/runsReplicationService.part3.test.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.ts
**/*.{ts,tsx,js,jsx,mts,cts,mjs,cjs,md,mdx}
📄 CodeRabbit inference engine (CLAUDE.md)
Always import from
@trigger.dev/sdkwhen writing Trigger.dev tasks; never use@trigger.dev/sdk/v3or deprecatedclient.defineJob.
Files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
**/*.{test,spec}.{ts,tsx,js,jsx}
📄 CodeRabbit inference engine (AGENTS.md)
**/*.{test,spec}.{ts,tsx,js,jsx}: Write unit tests with Vitest, keep test files beside the files under test, and use descriptivedescribeanditblocks.
Avoid mocks or stubs in tests; when Redis or Postgres are needed, use the helpers from@internal/testcontainers.
Files:
apps/webapp/test/runsReplicationService.part3.test.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.ts
apps/webapp/**/*.server.ts
📄 CodeRabbit inference engine (apps/webapp/CLAUDE.md)
apps/webapp/**/*.server.ts: Never userequest.signalfor detecting client disconnects. UsegetRequestAbortSignal()fromapp/services/httpAsyncStorage.server.tsinstead, which is wired directly to Expressres.on('close')and fires reliably
Access environment variables viaenvexport fromapp/env.server.ts. Never useprocess.envdirectly
Always usefindFirstinstead offindUniquein Prisma queries.findUniquehas an implicit DataLoader that batches concurrent calls and has active bugs even in Prisma 6.x (uppercase UUIDs returning null, composite key SQL correctness issues, 5-10x worse performance).findFirstis never batched and avoids this entire class of issues
Files:
apps/webapp/app/services/runsReplicationService.server.ts
🧠 Learnings (22)
📚 Learning: 2026-05-14T14:54:39.095Z
Learnt from: ericallam
Repo: triggerdotdev/trigger.dev PR: 3545
File: .server-changes/agent-view-sessions.md:10-10
Timestamp: 2026-05-14T14:54:39.095Z
Learning: In the `trigger.dev` repository, do not flag inconsistent dot vs slash notation in route/path strings inside `.server-changes/*.md` files. These markdown files are consumed verbatim into the changelog, so the mixed notation (e.g., `resources.orgs.../runs.$runParam/...`) is intentional and should be preserved as-is.
Applied to files:
.server-changes/clickhouse-output-string.md
📚 Learning: 2026-03-22T13:26:12.060Z
Learnt from: ericallam
Repo: triggerdotdev/trigger.dev PR: 3244
File: apps/webapp/app/components/code/TextEditor.tsx:81-86
Timestamp: 2026-03-22T13:26:12.060Z
Learning: In the triggerdotdev/trigger.dev codebase, do not flag `navigator.clipboard.writeText(...)` calls for `missing-await`/`unhandled-promise` issues. These clipboard writes are intentionally invoked without `await` and without `catch` handlers across the project; keep that behavior consistent when reviewing TypeScript/TSX files (e.g., usages like in `apps/webapp/app/components/code/TextEditor.tsx`).
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
📚 Learning: 2026-03-22T19:24:14.403Z
Learnt from: matt-aitken
Repo: triggerdotdev/trigger.dev PR: 3187
File: apps/webapp/app/v3/services/alerts/deliverErrorGroupAlert.server.ts:200-204
Timestamp: 2026-03-22T19:24:14.403Z
Learning: In the triggerdotdev/trigger.dev codebase, webhook URLs are not expected to contain embedded credentials/secrets (e.g., fields like `ProjectAlertWebhookProperties` should only hold credential-free webhook endpoints). During code review, if you see logging or inclusion of raw webhook URLs in error messages, do not automatically treat it as a credential-leak/secrets-in-logs issue by default—first verify the URL does not contain embedded credentials (for example, no username/password in the URL, no obvious secret/token query params or fragments). If the URL is credential-free per this project’s conventions, allow the logging.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
📚 Learning: 2026-05-18T08:21:27.694Z
Learnt from: d-cs
Repo: triggerdotdev/trigger.dev PR: 3632
File: apps/webapp/sentry.server.ts:4-21
Timestamp: 2026-05-18T08:21:27.694Z
Learning: When handling Prisma error P1001 ("Can't reach database server") in TypeScript, don’t assume a single error shape. Prisma can surface P1001 via two different error classes/fields: `PrismaClientKnownRequestError` exposes it as `err.code === "P1001"` (common during mid-query connection drops), while `PrismaClientInitializationError` exposes it as `err.errorCode === "P1001"` (common on client startup failure). Therefore, predicates should use `err.code === "P1001" || err.errorCode === "P1001"`. Do not flag `err.code === "P1001"` as “unreachable/never matches,” as it is expected in production.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
📚 Learning: 2026-05-18T08:21:27.694Z
Learnt from: d-cs
Repo: triggerdotdev/trigger.dev PR: 3632
File: apps/webapp/sentry.server.ts:4-21
Timestamp: 2026-05-18T08:21:27.694Z
Learning: When handling Prisma errors for P1001 ("Can't reach database server"), do not assume it only appears under a single property name. Prisma may surface P1001 via either `PrismaClientKnownRequestError` (`err.code === "P1001"`, e.g., mid-query connection drops) or `PrismaClientInitializationError` (`err.errorCode === "P1001"`, e.g., client startup connection failure). To reliably detect the condition, check `err.code === "P1001" || err.errorCode === "P1001"`, and avoid review rules that would incorrectly flag `err.code === "P1001"` as unreachable/never-matching.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
📚 Learning: 2026-06-13T19:53:13.759Z
Learnt from: ericallam
Repo: triggerdotdev/trigger.dev PR: 3937
File: packages/trigger-sdk/skills/realtime-and-frontend/SKILL.md:258-260
Timestamp: 2026-06-13T19:53:13.759Z
Learning: When reviewing code that uses `trigger.dev/react-hooks`’s `useRealtimeRun`, preserve the call signature where the first argument is the full realtime handle object (not `handle.id`). This is intentional to maintain type-safety and is consistent with the official docs; do not suggest changing the first argument from the handle object to `handle.id`.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
📚 Learning: 2026-06-17T17:13:49.929Z
Learnt from: matt-aitken
Repo: triggerdotdev/trigger.dev PR: 3948
File: apps/webapp/app/routes/_app.orgs.$organizationSlug.projects.$projectParam.env.$envParam.bulk-actions.$bulkActionParam/route.tsx:48-62
Timestamp: 2026-06-17T17:13:49.929Z
Learning: In triggerdotdev/trigger.dev, within `dashboardLoader`/`dashboardAction` (or similar context resolver code) whenever you resolve an organization ID from an organization slug for RBAC/enterprise authorization scope, always read from the primary Prisma client (`prisma`), not `$replica`. Using `$replica` can hit replica-lag and cause the RBAC lookup/authorization to run without the correct org scope (bypassing intended role enforcement). Implement the slug→org lookup with `prisma.organization.findFirst(...)` (or equivalent primary-client query) and add an inline comment documenting why the primary client is required (replica lag could lead to unscoped RBAC checks).
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
📚 Learning: 2026-06-23T13:04:21.413Z
Learnt from: carderne
Repo: triggerdotdev/trigger.dev PR: 4023
File: apps/webapp/app/services/upsertBranch.server.ts:14-18
Timestamp: 2026-06-23T13:04:21.413Z
Learning: In TypeScript, it’s valid to `import { type X }` and then use `typeof X` in a type-only position, e.g. `type Alias = z.infer<typeof X>`. The `type` modifier suppresses the runtime import, but the type checker still has the full exported type so `z.infer<typeof X>` can resolve correctly. In code reviews, don’t flag this as a TypeScript compile error as long as `typeof X` is used in a type context (e.g., with `z.infer`, `type` aliases, generics), not as a runtime value.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
📚 Learning: 2026-05-07T12:25:18.271Z
Learnt from: d-cs
Repo: triggerdotdev/trigger.dev PR: 3531
File: apps/webapp/test/sentryTraceContext.server.test.ts:9-47
Timestamp: 2026-05-07T12:25:18.271Z
Learning: In the triggerdotdev/trigger.dev webapp test suite, it is acceptable to leave `createInMemoryTracing()` calls that register a global `NodeTracerProvider` without `afterEach`/`afterAll` teardown. Do not flag this as a test-ordering risk when the code follows the established pattern used across webapp tests (e.g., replication service/benchmark/backfiller tests). This is considered safe because `trace.getActiveSpan()` when called outside a `context.with(...)` block reads `AsyncLocalStorage.getStore()` (undefined when no `run()` scope exists), so it falls back to `ROOT_CONTEXT` with no attached span—regardless of which provider is registered.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/test/runsReplicationService.part6.test.ts
📚 Learning: 2026-05-28T20:02:10.647Z
Learnt from: myftija
Repo: triggerdotdev/trigger.dev PR: 3772
File: apps/webapp/test/findOrCreateBackgroundWorker.test.ts:1-1
Timestamp: 2026-05-28T20:02:10.647Z
Learning: In the triggerdotdev/trigger.dev monorepo, for the `apps/webapp` package use the established convention of storing Vitest tests (unit, integration, and e2e) under `apps/webapp/test/` rather than colocating them next to source files. Do not flag files located in `apps/webapp/test/` as violating any rule that says to colocate tests with source.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/test/runsReplicationService.part6.test.ts
📚 Learning: 2026-05-12T21:04:05.815Z
Learnt from: ericallam
Repo: triggerdotdev/trigger.dev PR: 3542
File: apps/webapp/app/components/sessions/v1/SessionStatus.tsx:1-3
Timestamp: 2026-05-12T21:04:05.815Z
Learning: In this Remix + TypeScript codebase, do not flag a server/client boundary violation when a file imports only types from a module matching `*.server`.
Specifically, it’s safe to import types using `import type { Foo } from "*.server"` or `import { type Foo } from "*.server"` because TypeScript erases type-only imports at compile time and they emit no JavaScript, so they won’t cross the Remix server/client bundle boundary.
Only raise the boundary concern for value imports (e.g., `import { Foo }` without `type`, or `import Foo`), since those produce JavaScript output.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsapps/webapp/test/runsReplicationService.part6.test.tsapps/webapp/app/services/runsReplicationService.server.ts
📚 Learning: 2026-06-25T18:21:51.905Z
Learnt from: carderne
Repo: triggerdotdev/trigger.dev PR: 4039
File: apps/webapp/app/routes/invite-revoke.tsx:0-0
Timestamp: 2026-06-25T18:21:51.905Z
Learning: During the Zod v4 migration in the triggerdotdev/trigger.dev webapp, ensure any imports from `conform-to/zod` use the Zod-4 subpath: `conform-to/zod/v4` (e.g., `import { parseWithZod } from "conform-to/zod/v4"`). Do not import from the package root `conform-to/zod`, because it is the Zod 3 implementation and may load Zod-3-only symbols (e.g., `ZodBranded`, `ZodEffects`), which can throw at module load (notably with `zod4.4.3`). This should be enforced across `apps/webapp/**/*` where helpers like `parseWithZod` and `conformZodMessage` are used.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsapps/webapp/test/runsReplicationService.part6.test.tsapps/webapp/app/services/runsReplicationService.server.ts
📚 Learning: 2026-05-18T14:40:02.173Z
Learnt from: ericallam
Repo: triggerdotdev/trigger.dev PR: 3658
File: packages/core/src/v3/realtimeStreams/manager.test.ts:1-147
Timestamp: 2026-05-18T14:40:02.173Z
Learning: In the triggerdotdev/trigger.dev repo, the policy “Never mock anything — use testcontainers instead” should only be enforced for integration tests that interact with real external services (e.g., Redis, Postgres) via actual infrastructure. For unit tests that exercise pure in-memory logic (e.g., cache semantics) it is OK to stub collaborators such as `ApiClient` using Vitest (`vi.fn()`) to assert call counts or control behavior. Do not flag `vi.fn()`-based `ApiClient` stubs in unit tests as violations of the testcontainers policy.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.ts
📚 Learning: 2026-06-04T18:16:35.386Z
Learnt from: nicktrn
Repo: triggerdotdev/trigger.dev PR: 3836
File: apps/supervisor/src/backpressure/backpressureMonitor.ts:3-5
Timestamp: 2026-06-04T18:16:35.386Z
Learning: When reviewing TypeScript in this repo, apply the rule “prefer type aliases over interfaces” only to data/object shapes and union/intersection type modeling. If an interface is being used as a behavioral contract for collaborators to implement (e.g., method-shape interfaces that define required behavior, such as `BackpressureLogger` / `BackpressureSignalSource` in `apps/supervisor/src/backpressure/backpressureMonitor.ts`), keep it as an `interface` and do not flag it as a type-alias-vs-interface violation.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
📚 Learning: 2026-06-09T17:58:04.699Z
Learnt from: 0ski
Repo: triggerdotdev/trigger.dev PR: 3879
File: apps/webapp/app/models/vercelIntegration.server.ts:619-630
Timestamp: 2026-06-09T17:58:04.699Z
Learning: In this codebase, outbound raw `fetch` calls should typically rely on Node/undici’s default request timeout (about ~300s) rather than adding a per-call `AbortController` + `setTimeout` wrapper inside individual functions (e.g. in files like `apps/webapp/app/models/vercelIntegration.server.ts`). During code review, do not flag the absence of a per-call timeout on a single `fetch` as an issue; if per-call timeouts are needed, they should be implemented via a codebase-wide convention (e.g., a shared fetch wrapper or documented pattern) rather than ad-hoc per-function changes.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsapps/webapp/app/v3/querySchemas.tsinternal-packages/tsql/src/query/schema.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.tsinternal-packages/tsql/src/query/printer.tsapps/webapp/app/services/runsReplicationService.server.tsinternal-packages/clickhouse/src/taskRuns.ts
📚 Learning: 2026-06-16T09:19:47.637Z
Learnt from: d-cs
Repo: triggerdotdev/trigger.dev PR: 3960
File: apps/webapp/test/prismaInfrastructureErrorCapture.test.ts:0-0
Timestamp: 2026-06-16T09:19:47.637Z
Learning: In this repo’s Vitest setup, `vitest.config.ts` uses `globals: true`, so identifiers like `vi`, `describe`, `it`, and `expect` are available as globals in Vitest test files. During code review, do not flag missing `vi`/`describe`/`it`/`expect` imports as a runtime error or correctness issue when they’re used in `*.test.ts/tsx` or `*.spec.ts/tsx` files. Explicit imports are still preferred for consistency, but they’re not required for runtime behavior.
Applied to files:
apps/webapp/test/runsReplicationService.part3.test.tsinternal-packages/clickhouse/src/taskRuns.test.tsapps/webapp/test/runsReplicationService.part6.test.tsinternal-packages/tsql/src/query/printer.test.tsinternal-packages/clickhouse/src/tsql.test.ts
📚 Learning: 2026-03-29T19:16:28.864Z
Learnt from: nicktrn
Repo: triggerdotdev/trigger.dev PR: 3291
File: apps/webapp/app/v3/featureFlags.ts:53-65
Timestamp: 2026-03-29T19:16:28.864Z
Learning: When reviewing TypeScript code that uses Zod v3, treat `z.coerce.*()` schemas as their direct Zod type (e.g., `z.coerce.boolean()` returns a `ZodBoolean` with `_def.typeName === "ZodBoolean"`) rather than a `ZodEffects`. Only `.preprocess()`, `.refine()`/`.superRefine()`, and `.transform()` are expected to wrap schemas in `ZodEffects`. Therefore, in reviewers’ logic like `getFlagControlType`, do not flag/unblock failures that require unwrapping `ZodEffects` when the input schema is a `z.coerce.*` schema.
Applied to files:
apps/webapp/app/v3/querySchemas.ts
📚 Learning: 2026-06-09T16:27:26.195Z
Learnt from: myftija
Repo: triggerdotdev/trigger.dev PR: 3878
File: apps/webapp/app/v3/services/computeTemplateCreation.server.ts:0-0
Timestamp: 2026-06-09T16:27:26.195Z
Learning: When working in triggerdotdev/trigger.dev code related to worker-group/region default resolution (e.g., defaultWorkerInstanceGroupId handling used by getGlobalDefaultWorkerGroup, getDefaultWorkerGroupForProject, and RegionsPresenter), do NOT add org-level featureFlags overrides in only one resolution site. That can cause template creation routing/decisions to diverge from actual run routing. If org-level override of the default region/worker group is required, it must be centralized in getGlobalDefaultWorkerGroup so every resolution path remains aligned.
Applied to files:
apps/webapp/app/v3/querySchemas.ts
📚 Learning: 2026-03-26T09:02:07.973Z
Learnt from: myftija
Repo: triggerdotdev/trigger.dev PR: 3274
File: apps/webapp/app/services/runsReplicationService.server.ts:922-924
Timestamp: 2026-03-26T09:02:07.973Z
Learning: When parsing Trigger.dev task run annotations in server-side services, keep `TaskRun.annotations` strictly conforming to the `RunAnnotations` schema from `trigger.dev/core/v3`. If the code already uses `RunAnnotations.safeParse` (e.g., in a `#parseAnnotations` helper), treat that as intentional/necessary for atomic, schema-accurate annotation handling. Do not recommend relaxing the annotation payload schema or using a permissive “passthrough” parse path, since the annotations are expected to be written atomically in one operation and should not contain partial/legacy payloads that would require a looser parser.
Applied to files:
apps/webapp/app/services/runsReplicationService.server.ts
📚 Learning: 2026-04-20T14:50:16.440Z
Learnt from: ericallam
Repo: triggerdotdev/trigger.dev PR: 3417
File: apps/webapp/app/services/sessionsReplicationService.server.ts:224-231
Timestamp: 2026-04-20T14:50:16.440Z
Learning: In Trigger.dev’s replication services (e.g., sessionsReplicationService.server.ts and runsReplicationService.server.ts), the “acknowledge-before-flush” behavior is intentional. The `_latestCommitEndLsn` should be updated at Postgres commit time and acknowledged on a periodic interval (via methods like `#acknowledgeLatestTransaction`) without waiting for ClickHouse batch flush to complete. Reviewers should not flag this as a durability/ordering bug; it is an established project-wide at-least-once delivery trade-off used across both runs and sessions replication services.
Applied to files:
apps/webapp/app/services/runsReplicationService.server.ts
📚 Learning: 2026-04-20T15:08:49.959Z
Learnt from: ericallam
Repo: triggerdotdev/trigger.dev PR: 3417
File: apps/webapp/app/services/sessionsReplicationService.server.ts:204-215
Timestamp: 2026-04-20T15:08:49.959Z
Learning: For replication services in `apps/webapp/app/services/*ReplicationService.server.ts`, keep the `ConcurrentFlushScheduler` deduplication key shape consistent across the related services (e.g., sessions vs runs) by using the same `${item.event}_${item.session.id}` / `${item.event}_${item.run.id}` pattern. If the key format ever needs to change (such as keying only by session/run id), make the update in all related replication services together—never in just one—so deduplication behavior stays aligned across services.
Applied to files:
apps/webapp/app/services/runsReplicationService.server.ts
📚 Learning: 2026-05-05T09:38:02.512Z
Learnt from: d-cs
Repo: triggerdotdev/trigger.dev PR: 3523
File: apps/webapp/app/routes/api.v3.batches.ts:178-181
Timestamp: 2026-05-05T09:38:02.512Z
Learning: When reviewing code that catches `ServiceValidationError` in `*.server.ts` files, do not blindly forward `error.status` to HTTP responses, because SVEs may be thrown with non-default statuses (e.g., 400/500) and forwarding them can cause client-visible behavioral regressions (e.g., surfacing 500s to clients). Prefer a safe default response status of `error.status ?? 422`, but only after confirming via the reachable call graph that the caught `ServiceValidationError` instances are expected to carry those non-default statuses; otherwise, normalize to `422` to avoid unexpected client-visible 5xx behavior.
Applied to files:
apps/webapp/app/services/runsReplicationService.server.ts
🔇 Additional comments (16)
internal-packages/tsql/src/query/schema.ts (1)
257-278: LGTM!internal-packages/tsql/src/query/printer.ts (3)
2369-2374: LGTM!
2668-2678: 🎯 Functional Correctness | 💤 Low valueEdge: JSON keys containing
./special characters build ambiguous paths.
buildJsonPathjoins string parts as.${part}. A leaf key that itself contains a dot (or other JSON-path-significant character) yields a path thatJSON_VALUEwill interpret as nested rather than as a literal key. SQL injection isn't a concern (the whole path isescapeClickHouseString-escaped), but path semantics can diverge for unusual keys. Bracket/quoted notation (e.g.["foo.bar"]) would be more robust if such keys are reachable via TRQL.
2628-2662: 🎯 Functional Correctness
rawColumnanddataPrefixare not combined on these schemas.outputusesrawColumnonly, anderrorusesdataPrefixonly.> Likely an incorrect or invalid review comment.internal-packages/clickhouse/src/taskRuns.ts (1)
31-31: LGTM!Also applies to: 121-121, 191-191, 332-332
internal-packages/clickhouse/schema/035_add_task_runs_v2_output_raw.sql (1)
1-17: 🗄️ Data Integrity & IntegrationMigration 035 is the next sequential number; no renumbering needed.
internal-packages/tsql/src/query/printer.test.ts (1)
944-1039: LGTM!internal-packages/clickhouse/src/tsql.test.ts (1)
1611-1740: LGTM!apps/webapp/test/runsReplicationService.part3.test.ts (1)
171-181: LGTM!apps/webapp/test/runsReplicationService.part6.test.ts (1)
476-478: LGTM!internal-packages/clickhouse/src/taskRuns.test.ts (2)
228-228: LGTM!Also applies to: 286-286, 391-391, 504-504, 562-562, 624-624, 682-682
93-93: 🩺 Stability & Availability
output_rawis already last in bothTaskRunInsertArrayandTASK_RUN_COLUMNS.apps/webapp/app/services/runsReplicationService.server.ts (2)
1378-1393: LGTM!
1108-1108: 🗄️ Data Integrity & IntegrationNo remaining native
outputreaders
The TRQL schema already reads through therawColumn/JSON_VALUEbridge, so the nativeoutputJSON column no longer needs separate consumers.apps/webapp/app/v3/querySchemas.ts (1)
356-360: LGTM!.server-changes/clickhouse-output-string.md (1)
1-7: LGTM!
The rawColumn bridge now compiles JSON path access to a JSONExtract expression that returns string scalars unquoted (so equality, LIKE and display match native scalar access) while returning object and array subtrees as raw JSON text. Missing keys yield an empty string.
343c8fe to
f45b3e3
Compare
When a JSON path on a String-backed column is compared against a numeric or boolean literal, extract the path as that type (JSONExtractFloat / JSONExtractBool) so the comparison is numeric/boolean with correct equality and ordering, rather than a string comparison that would either error on type mismatch or sort lexically. String literals and the LIKE family keep the string bridge.
BETWEEN on a String-backed JSON path with numeric bounds now extracts the path via JSONExtractFloat (matching the comparison operators), so range checks are numeric rather than lexical string comparisons.
Summary
Task run
outputis now stored as serialized JSON text in a newoutput_rawString column in ClickHouse, instead of the native JSONoutputcolumn.Arbitrary task output is unbounded in shape, and a sufficiently deep/wide payload could push the JSON column's accumulated type past ClickHouse 26.2's
input_format_binary_max_type_complexitylimit. When that happened the replication insert failed, the terminalCOMPLETED/FAILEDrow never landed, and the run appeared stuck in an executing state in the runs list. The same ceiling also broke reads that selected the column back.Fix
A
Stringhas constant binary type complexity regardless of how deep or wide the payload is, so the failure mode is eliminated for both writes and reads with no server setting involved.output_rawand stops writing the deep object to the native JSONoutputcolumn.output.foo) compiles toJSON_VALUE(output_raw, '$.foo'); bare reads and full-text search useoutput_rawdirectly, backed by a new ngram index so search stays fast.error(read by the errors table) and the payload column are intentionally left as native JSON.Verified end-to-end against a local ClickHouse: path filter/select, nested scalar paths, bare value reads,
IS NULL, and full-textLIKEsearch all return correct results, alongside the existing replication test suite.